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1 The Classical Combinatorial Description: 
Configurations and Observables 

Biological molecules that convey information (nucleic acids and proteins) can 
be abstractly modelled as finite sequences of letters chosen from an alphabet A. 
This alphabet can be of 4 letters, when we deal with nucleic acids, of 20 letters, 
when we deal with proteins, or some extended version of those alphabets 
— possibly countable — when some additional information is incorporated 
into the description (eg. assigning exon-intron character to each nucleotide, 
including stereo-chemical and pairing information to each amino-acid, etc.) All 
living organisms use the same alphabet to encode the biological information 
relevant to their survival both as individuals and as species. Individuals of the 
same species use sequences of (almost) fixed length. 

To capture the combinatorial variety of all possible sequences arising in all 
living matter, we use a universal set, the (sequence) configuration space. An 
alphabet A gives rise to a configuration space A* = U^qA". Every possible 
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and imaginable sequence is bijectively mapped to a single point of A*. The 
configuration space has a rooted tree structure carrying thus several natural 
distances: the tree distance counts the number of generations one must go 
back to find a common prefix, the Hamming distance counts the number of 
sites where the residues differ. 

An observable X, taking values in some space X, is a map X : A* ^ X. In 
most situations, the set X is just R or some subset of it. The most elementary 
observables are {0, l}-valued observables also called questions, i.e. indicator 
functions of subsets of A* . 

Having delimited the playground — the configuration space — on which 
all sequences can be represented, chronological changes occurring on sequences 
can be represented as a time evolution on A* . Here the term evolution applies 
mathematically to every time scale to denote a A*-valued continuous time 
process, including 

1 . the accretion processes consisting in duplication of the nucleic acid molecule, 
nucleotide by nucleotide, occurring during cell division viewed as a very 
rapid length increasing time evolution over nearest neighbouring points of 
the configuration space; 

2. the local modifications of the genetic sequence occurring at random epochs 
of the life span of an individual such as point mutations (DNA methy- 
lation, forward or backward slippage, etc.), modifying only few adjacent 
letters, viewed as rapid evolution over closely lying points of the configu- 
ration space (almost) preserving the length of sequences; 

3. the global shuffling of vast regions of the genetic sequence of a given species 
occurring during trans-locations, inversions, independent assortments or 
chiasmata during sexual reproduction viewed as rapid length (almost) 
preserving time evolutions over distant points of the configuration space 
but lying at (almost) the same distance from the root; 

4. the modifications of species sequences occurring over geological time scales 
viewed as length non preserving slow evolution over distant points of the 
configuration space. 

Since the configuration space A* is countable, time evolution is a continuous 
time jump process over A*; when sampled at the instants of occurrence of 
the jumps, this process is a random walk on the tree A*. For definiteness, we 
focus only on evolution of sequences of nucleotides and more precisely of the 
two first local types in the above list. 

Example 1. Let A = {A, C,G,T}. Then A* = U^^^A^, where A'' contains 
the empty sequence (denoted k in the sequel), A^ contains the 4 sequences 
of length 1: A, C, G, and T, etc. The set A^ contains 4^ sequences of length 
N . The sequences of given length can be represented as vertices of a given 
generation of a rooted tree; all vertices but the root have degree |A| + 1 = 5. 
(See Fig.Q]) Sequences AG and GA have tree and Hamming distance 2; AA 
and GA have tree distance 2, Hamming distance 1. 
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Fig. 1. The bijection between the sequence configuration space A* and a rooted 
tree. To the root corresponds the empty sequence k of zero length and no letter. To 
get the sequence mapped to another vertex of the tree, one has to read the letters 
encountered on the path joining the root to this particular vertex. 

2 The Dual Probabilistic Classical Description: States 

The configuration space as playground wliere sequences can be described and 
evolve is very useful, concise, and powerful. Nevertheless, it is much too precise 
to be directly cxploita})l{^: giving a point in A* for a sequence of N residues 
is equivalent to determining one out of |A|^ variables; for human genome for 
instance the latter reads approximately 4^^^*^ , a tremendously huge number. 
Now when examining the DNA sequences appearing in the cells of a multi-cell 
individual, we find that they are identical but for some accidental modifica- 
tions. What is needed is some qualitative description of sequences allowing to 
make statistical comparisons between populations (i.e. sets) of sequences. The 
configuration space is also too vast: the overwhelming majority of elements 
of A* never occur as biologically viable and relevant sequences. Finally, the 
evolution process on A*, even when it starts from a single sequence a 6 A*, 
will introduce some natural randomness on the set of sequences obtained by 
the evolution of a. 

The natural mathematical way to overcome all these difficulties is to deal 
with probability measures, also called states, on sets of sequences. A popula- 
tion of individuals instead of being described by a precise set of elements of 
A* is described by a state p, supported by this particular set, containing all 
useful and relevant statistical information about individual sequences. 

Therefore, the pair consisting of a classical configuration space and a state 
(A* , p) is nothing else than a standard probability space; classical observables 
are nothing else than random variables on this space. 
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3 On the Relevance of Quantum Mechanics in Biology: 
what, how, why? 

What? Mathematically, quantum mechanics is an extension of classical proba- 
bility theory where real random variables are replaced by self-adjoint operators 
over an appropriate Hilbert space and classical states by quantum states. 

Physically, quantum mechanics is a refinement of classical mechanics con- 
taining a fundamental physical constant, the Planck constant h = 1.05457 x 
10"^'* Js. When action values are comparable to h, the laws of classical me- 
chanics do not any longer describe reality satisfactorily. Chemical properties 
like affinities of different atoms to form molecules, stability of matter, confor- 
mational geometry of molecules, etc. rely on quantum mechanical rules. 

How? Biology relies on chemistry. Therefore it is pointless to ask whether 
biology relies on quantum mechanics, it certainly does. The relevant ques- 
tion is how it can. Due to the smallness of h, quantum phenomena become 
in general irrelevant when considering large systems at high temperatures, 
and biomolecules are large warm systems! Such systems although microscopi- 
cally governed by quantum laws are globally in the realm of classical physics. 
This phenomenon is known as decoherence. High temperature superconduc- 
tivity 13 and double-slit interference patterns for larger and larger molecules 
like fuUerenes Cgo 1 , fluorofuUerenes C60-F48, and even biomolecules (meta- 
tetraphenylporphyrin CinH^oNi involved in hemoglobin transfers) observed 
lately, demonstrate coherent mesoscopic quantum behaviour. 

Several authors advocated that similar emerging mesoscopic coherent 
quantum behaviour can arise in some circumstances in biology J,3, 16, l2|. 
It is therefore worth studying biomolecules as quantum mesoscopic systems. 
Notice however that we don't claim to solve Schrodinger equation for every 
internal degree of freedom of a given biomolecule, but rather study the im- 
plication of emerging mesoscopic quantum behaviour with very few quantum 
degrees of freedom. 

Why? All quantum phenomena at the size of a biomolecule in vivo, if any, 
are necessarily fragile emergent ones, very easily returning to the classical 
realm. It is estimated that for some biomolecules coherence can persist up to 
10~^s; in I13J this estimate is used to explain the rapidity of adaptive mutation 
occurring in Escherichia coli in lactose environment. It is in fact known that 
the time needed for the combinatorial exploration of the configuration space 
is exponential in the length of the searched sequence for classical search but 
only polynomial for quantum search [51 [Tj- Moreover, hitting time can be 
described as a partial measurement process in quantum mechanics. Hence, 
quantum evolution and quantum measurement, provide really new insight 
and explanation's for biological phenomena, strictly inside the known laws of 
Nature: quantum evolution explains the observed rapidity of phenomena like 
adaptive mutation or emergence of life; quantum measurement explains the 
process of selection of fittest or the adaptation to the environment. 
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4 Quantum Direct and Dual Descriptions: Rays and 
Quantum States 

We give here an elementary and very concise introduction to quantum de- 
scription. The interested reader can consult any standard book on the subject 
(for instance JW) or the the freely available lecture notes JB] on the home 
page of the author. 

Quantum configuration spaces are complex separable Hilbert spaces. For 
our purposes, the quantum sequence configuration space is the Hilbert space 
H = ^^(A*). An element of H is a complex function ip on A* such that 
J2a£A' < ^^'^ scalar product is defined for all 0, i/; G H by 

{ (j) \ tjj ) = X^aGA* 0(Q!)V'('^)- An orthonormal basis of this space is provided 
by a family {ea)aGA* of elements of H, indexed by the classical configurations. 
Therefore, basis elements are functions Gq, defined by ea{/3) — Sa,i3, for every 
a, /3 £ A*, where 6 denotes the Kronecker symbol; this basis is isomorphic 
to A*. Every element if; G M can be decomposed as ijj — J^aeA* ^(Q^)ea- 
Quantum configurations are rays, i.e. vectors £M oi unit norm. 

Evolution is a unitary operator U acting on H, i.e. verifying U*U = UU* = 
1. Due to the linear structure of the space, it is enough to study the action of 
U on the basis elements: it becomes then an infinite unitary matrix. 

Quantum observables are self-adjoint operators X acting on H, i.e. verify- 
ing X* — X. On the basis vectors, they are represented by infinite self-adjoint 
matrices. The most elementary observables are projections (the quantum ana- 
logue of indicators); the spectral theorem establishes the decomposition of any 
self-adjoint operator in terms of projections. 

Quantum states, p, are self-adjoint, positive, trace class, normalised oper- 
ators acting on H, called density matrices. Quantum observables are merely 
non-commutative random variables in the sense that the expectation of X 
in state p is given by KX = tr{pX). Classical probability theory is a special 
case of quantum mechanics where all observables and states are represented 
by diagonal matrices. 

The measurement process is what renders quantum mechanics so counter- 
intuitive. Measurement corresponds to asking a question on the values an 
observable can take. Suppose that we consider an observable X taking a 
discrete set of values {Xi)i. Therefore, we have X = J2i^i-Pi^ where classi- 
cally Pi — l[x=\i} while quantum mechanically Pi is the projector to the 
eigenspace corresponding to the eigenvalue A^. Now perform the measure- 
ment in the classical state (probability measure) or the quantum state (den- 
sity matrix) p and consider the state after measurement in the two following 
situations: 

1. The result is filtred to get a conditioning on the fact that a particular 
value Xj has been observed after measurement: classically we get 
/?(•) /3('|^ = Xj), quantum mechanically p i-^ tf/ ^PO • 
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2. The result is not filtred: classically we get, by virtue of Bayes' formula 



If p and X are not simultaneously diagonalisable, the state after measurement 



5 Grammars and Languages 

Grammars are powerful tools to drastically reduce the size of the available 
configuration space by introducing combinatorial constraints by a set of ele- 
mentary transformations acting on the configurations (classical or quantum). 
In the sequel we use the symbol a to denote indistinguishably the classical 
configuration a in the classical case and the basis element Bq in the quantum 
case. 

Grammars act on configurations in differents ways: sequentially in the case 
of generational grammars, in parallel for the so-called Lindemayer systems, or 
in an asynchronous way. 

5.1 Generational Grammars and their Classification 

Initially introduced for the description and analysis of natural languages [Sj, 
generational grammars are extensively used nowadays as models of computa- 
tion. A generational grammar r' is a small set of rules that act sequentially on 
elements of A* to produce a distinguished subset of A* , called the language 
L{r) generated by F. More specifically: 

Definition 1. A (generational) grammar F is a quadruple F = (A„, At, 7T, S), 
where 

1. An and At are two disjoint finite sets, the alphabets of non-terminal and 
terminal symbols respectively; we denote by A = A„ U At and to avoid 
trivialities we always assume that A„ ^ 0, 

2. n <Z (A+ \ A^) X A* is a finite set, the productions, and 

3. S G An is the initial symbol or axiom. 

Productions are rules for possible substitutions of subwords of a sequence by 
other subwords. They define a binary relation on A* as follows: 

Definition 2. Let a,/3 £ A*. We say that (3 is directly derivable from a, and 
denote by the binary relation {a f3) ^ {3a' C a; 3/3' C /3 : (a', /?') £ 77), 
where a' C a means that a' is a subword of a. 

Denote by i the transitive closure^ and by A the accessibility relation^ of 
the direct derivability relation. 

^The transitive closure of a binary relation R is the relation = U^i-R". 
^The accessibility relation associated with a binary relation i? on A is the binary 
relation R* defined for a,b e A by: {aR*b) <^ ((a = 6) V (aR^b)). 




without filtering PjpPj ^ p. 
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Definition 3. Let F be a grammar. The language L(r) generated by F is the 
set L{F) = {a G A* : S* A a}. 

We denote by Dom(i7) = {a £ A+ \ A* | 3/3 G A* : [a, 13) G 77} and 
Ran77(a) = {/3 G A*|(a,/3) G il} if a G Dom(i7) and otherwise. 

If for some a G Dom(i7), and some n we have Ran77(a) — {/3^^\ . . . j/S*-"-*}, 
then we use the Backus-Naur shorthand notation a /J^^'l • • • l/?'^"-' to mean 
that (a,/?'^^-'), . . . , (a,/3'-"-') are all the possible productions with first element 
a. Grammars are classified according to their descendance degree and their 
Chomsky (acontextuality) degree. 

The descendance degree, d, is defined by d = max{|Ran77(a)| : a G 
Dom(i7)}. Notice that for all non-trivial grammars d> 1. li d — 1 the gram- 
mar is called deterministic otherwise it is called non- deterministic. It is worth 
noticing that deterministic descendance means that there exists a function 
: Dom(7T) — > A* whose graph is the set 77, i.e. (a,/?) G 77 /3 = (f){a). 
Fixing a given unpdating policy, this function induces a dynamical system 
# : A* — > A* such that the sequence occuring as successive transformations of 
the grammar on the axiom S appear as the trajectory of the dynamical system 
#(S'), ^0(^(5*), .... Such a trajectory is called a computational path and it can 
be finite if the system halts or infinite if it never halts. For non-deterministic 
descendance, there does not exist such a function 0, or more precisely, this 
function is multi-valued. At each step, we must use a branch of this function. 
The branches are assigned a probability vector or a unitary probability ampli- 
tude vector; we speak then of a stochastic or quantum grammar respectively. 
More precisely, if a G Dom(77), stochastic descendance means that with a 
is associated a vector Pq, — {pa.fj,f3 G A*) such that pa^p > 0, Pa^p = if 
(a,/3) ^ 77, and '^pPa.p — 1; quantum descendance means that with a is 
associated a vector ~ {ua.p,P G A*) such that Ua^p G C, Mq./s = if 
{a, 13) ^ 77, and Y^ii l'"a,/3p = 1- 

The second classifying parameter of grammars is their acontextuality or 
Chomsky degree. The different types of acontextuality are described in the 
Table H] 



Chomsky 


Grammar 


All productions (q, 13) of the form 


Recognition 





recursively enumerable 


13 with a G A+ \ Ai , /3 G A* 


eTM 


1 


context-sensitive 


a = a\oi'oi2 with ol G A^, 0102 7^ k. 








(3 = ai/3'a2, [3' k. 


eLBA 


2 


context-free 


a G AJ, /3 G A* 


ePDA 


3 


regular 


a G AJ , /9 G At or /3 G A* X Ai 


ePA 



Table 1. The Chomsky hierarchy of grammars. For every degree of acontextual- 
ity a universal automaton can be used to recognise the language: Turing machines 
(TM), linear nounded automata (LB A), push down automata (PDA), or finite au- 
tomata (FM). For every descendance type, the corresponding automaton acquires a 
prefix e G {D, N, S, Q} meaning that the evolution is deterministic, non-determintic 
(combinatorial), stochastic, or quantum 
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Example 2. A stochastic context-free grammar has been used in jMi, to de- 
scribe the secondary structure of RNA molecule. Its alphabets are At — 
{A,C,G,U} and A„ ~ {5o, . . . ,S'i3}, the initial symbol Sq; its productions 
n are of the form 



So ^ Si 

1S5 — * CSgG 

Sio — » CS'ioG\GSiiC 



Si -+ CS2G\AS2U 
Se ASr 
Sii ^ AS12U 



S2 AS3U 
Sv USrlGSs 
S12 ^ US'i^ 



S3 — >■ S^Sg 

Ss — G\U 
Si3 C 



Si US^A 
Sg ASioU 



A probability vector is associated with every production. Every computational 
path leads to a different realisation of the secondary structure. The elementary 
probability vectors of the productions induce a natural probability measure 
on the set of all possible secondary structures. A particular random realisation 
gives rise to the secondary structure depicted in Fig. |21 



^ S. So U 

S:: G G S- 



Fig. 2. The secondary RNA structure as a particular random realisation derived 
by the generetional context-free stochastic grammar introduced in |19) 



Several other uses of accretion context-free stochastic grammars are re- 
ported in bilogical literature. Accretion dynamics defines a random walk on 
the vertices of the computational paths leading to the words of the language. 
For stochastic context-sensitive grammars, the probability vectors depend on 
the position of the random walk on the subtree of A* . In this situation we 
speak about a random environment. Several very specific models of random 
walks in random environment on trees corresponding to particular classes of 
context-sensitive grammars have been studied in the literature (the interested 
reader can look at [SI ^1 ^| for instance) that allow to obtain useful 
properties for the probability measure on the words of the language in terms 
of ergodic properties of the random walk. However, the classification of these 
random walks is far from being complete and their complete study remains 
an interesting open problem. Other mathematical results concerning random 
walks stemming from generative grammars using more algebraic combinato- 
rial tools have been developped in [5]. 

5.2 Asynchronous Grammar- driven Processes 

A sequence of internal clocks are attached to subwords in Dom(iT); when they 
ring, at random times distributed exponentially, the subword is transformed 
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by a new subword according to the allowed productions. In general, produc- 
tions do not preserve the length of the words. If we denote by Na{a), a e A the 
number of letters a contained in the word a S A", then the passage from gen- 
eration n to n -f- 1 induces a multibranching process whose sub-populations 
behave like Na- To keep evolving words inside the same space we consider 
infinite length words from the very beginning and apply asynchronous evolu- 
tion on the infinite sequence. This procedure is quite standard in statistical 
physics; although mathematically more delicate to handle, the obtained re- 
sults are sharper than the finite case; finite size results can be inferred from 
infinite sequences. 

Example 3. The Fig. |31 illustrates how asynchronous grammar-driven process 
evolves for an infinite initial sequence. The evolution of a fixed window of 
size 2N -|- 1 is depicted in this figure. Notice that since productions are not 
length preserving in general, there does not exist a global coordinate system 
to number the residues. 



t4 


. A 


t3 


. A 


t2 


. A 


tl 


. A 


to = . . 


. A 




-N 



G G C ■ T T 

G G C ■ T T 

G G C ■ T T 

G G C T T T 

G G C T A ■ 



C A T A C G T 

C T T A C G T 

C A T A C G T 

C T T A C G T 

C T T A C G T 

J N 



Fig. 3. A random realisation of a computational path: Productions qiq'q2 — * 
ai/3'a2 occur at random times. At ti: ai = T, a-i = C, a' = A and /3' = TT. At t2: 
Qi = GC, oi2 — TT, ct' = T and 13' — k. etc. If we allow infinite re-numberings in 
order to impose a global coordinate system, at some places the configuration must 
be squeezed. The symbol • reminds where squeezing takes place 



In JI] , stochastic evolution in the absence of a global coordinate system 
has been studied and in this method has been extended of quantum 
evolution in the context of quantum gravity. These results are presented below 
adapted to a genetic context. 

The Classical Stochastic Case 

We consider infinite length configurations in X = A^ and the set f2 of con- 
tinuous time processes H — {lj : [0, oo[^ X admissible}. A process is termed 
admisssible if it is right continuous and if uj{s—) ^ for some s G [0, oof 
then there exist a left semi-infinite word a, a right semi-infinite word 7 
and two finite words (3,(3' with {(3,(3') £ TT such that ljo{s—) = a(3j and 
Lo{s) = a/3'7. Now fix some positive integer N and a finite initial configura- 
tion ^ of length 2iV-|-l. In the Fig.Oabove, iV = 6 and the initial configuration 
^ = AGGCTACTTACGT of length 13. Notice that the symbol • does not 
make part of the initial sequence! Denote by /2jv[?] ^ {uj G f2 : llJ-n{0) — 
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^_jv • • • wjv(O) = ^at} the set of process trajectories emanating from the cyhn- 
der set defined by the fixed configuration ^. 

Since a global coordinate system cannot be used, local observers are placed, 
let say at the original sites i and j (see Fig.|3|; denote by fiN-ij-t the subset 
of such trajectories for which the residues on sites i and j have not been 
modified up to time t while this property is false for sites k with i < k < j. 
The probability rates on the productions induce the probability F{f2N;i,j;t) on 
the set f^N-ij-t- 

Theorem 1. The limit limjv^oo P(^2jV;i.j:t) exists for all i,j G Z, i < < j 
and defines, for all t G [0, oo[, a joint probability fj,{i,j;t) such that 

{l,j}:i<0<3 

Remark 1. Although the above theorem appears as an existence result, as a 
byproduct of the proof, one gets very precise estimates on the properties of 
the probability measure /x. Fixing the grammar, we get estimates of the joint 
probability measure on sequences. 

The Quantum Case 

Productions are implemented by operators; more precisely, suppose that a — 
aia'a2 with = j — 1, for j > 1, and /3 = ai/3'a2, while tt := (a', /?') G 77. 
Then we define an operator A,r(j) by its action on basis vecors: ATr{j)ea = ep 
and similarly its adjoint A*(j)e^ = Cq. These operators define an operator, 
the Hamiltonian, 

Tren jeN 

with Att G C, Att denoting the complex conjugate of A^, c G M, and He a 
diagonal correction term. 

The above Hamiltonian is formally self-adjoint. Therefore the operator 
U{t) = BKpiitH) is formally unitary and corresponds to time evolutions. In- 
troducing the family of Hilbert supspaces H^v = spanjea, |a| < N} and denot- 
ing by P/v the projector to Hat, we get regularised finite length Hamiltonians 
Hn = PmHPjs!- The set of all self-adjoint operators on Hjv constitutes a finite 
dimensional C*-algebra denoted by %n- When passing to the inductive limit 
2too = limjv^ooSlAT, and then to its norm-closure 21 = 2loo we obtain the so- 
called quasi-local algebra of observables 21. Recall that the set of self-adjoint 
operators together with a state is the quantum (non-commutative) analogue 
of random variables. Hence the relevant question is whether a state can be 
defined on the quasi-local algebra 2t. (These are standard constructions in the 
context of quantum statistical mechanics; details can be found for instance in 

a-) 
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Theorem 2. Define pN — t^cxp("-Tg'w ) ' ^ — ^' ^^^''^ ^ state on SIat. 

For every quantum observable X G 21 w;e have limjv^oo ■tr(pjvX) = U{pX), 
where p is a state on the infinite system. 

Remark 2. Giving the productions and the corresponding unitary vector of 
evolution defines a state on the set of observables over infinite length sequences 
by virtue of Theorem 12 

6 Conclusion and Open Problems 

We have presented as simply as possible a formalism based on general gram- 
mars acting on sequence spaces. It is shown that for context-sensitive stochas- 
tic generational grammars, the relevant object to study is a random walk in 
random environment on tree. Asymptotic behaviour for such objects is known 
only for very special models. It is a challenging open problem to have a more 
complete classification of these random walks. 

For quantum grammars, the relevant objects are quantum random walks. 
Now the known models of random walks are essentially only onc-dimcnsional. 
Thus there are challenging open problems even for context-free and regular 
quantum grammars corresponding to quantum random walks on trees. 

Then we have presented the case of asynchronous (random or quantum) 
grammar-driven processes on infinite sequences and stated known results es- 
tablishing the existence of a joint state on the infinite-dimensional algebra of 
observables. These results show that if we know the grammar, there exist a 
global state that stems from this grammar. 

There are several open challenging problems in this context. Firstly, from 
experimental observations on very long sequences, we can estimate correlation 
properties of the state. Is it possible to reconstruct the Hamiltionian (hence the 
grammar) giving rise to this state? A second important problem is unicity : is it 
true that a given Hamiltonian gives rise to a unique state or some phenomenon 
of phase transition occurs? The consequences of a phase transition would 
be that the system becomes unstable; although two different cells share the 
same grammar, it is enough that some very small external perturbation acts 
differently on each of them for their genetic sequence to evolve (mutate) to 
different states. 

The above mentionned problems are essentially mathematical in nature. 
There are however several biological and algorithmic problems associated with 
them. The fundamental thesis defended in this work is that the genome statis- 
tics of a class of individuals of a given species is determined by the stochastic 
or quantum grammar inducing the asynchronous process. Assuming absence 
of phase transition and that for individuals belonging to two different classes 
(that can be distinguished for instance by an experimentally observed spec- 
tacular difference in the reaction to a drug) the genome statistics can be 
discriminated, it follows that the determining grammars must be different. 
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In order to accept this thesis, the algorithmic problems must be solved and 
precisely designed biological experiments must confirm it. But if it is even- 
tually established, it provides a mesoscopic explanatory scheme involving the 
fundamental mechanisms that govern the time evolution of the DNA molecule. 
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